Challenges You Will Face When Parsing PDFs with Python
theseattledataguy.comยท2hยท
Discuss: Hacker News
๐Ÿ“„PDF Archaeology
Top 11 Document Parsing AI Tools for developers in 2025
dev.toยท2dยท
Discuss: DEV
๐Ÿ“„Document Digitization
Point, Don't Point
ilovetypography.comยท2dยท
Discuss: Hacker News
๐Ÿ“œDocument Paleography
Converting a PDF to text locally with Ollama
huijzer.xyzยท2d
๐Ÿ‘๏ธOCR Verification
WorldCat Editions and Holdings Release
annas-archive.orgยท1dยท
Discuss: Hacker News
๐Ÿ“šMARC Records
OTW - Bandit Level 4 to Level 5
tbhaxor.comยท11h
๐Ÿ”งKAITAI
Bookends 15.2
tidbits.comยท1h
๐Ÿ“„PostScript
UTF-8 Is Beautiful
hackaday.comยท12h
๐Ÿ”ฃUnicode
Planarizing matchings
11011110.github.ioยท22h
๐ŸŽจGraph Coloring
Show HN: AI Ancestry Test โ€“ Free Ethnicity Prediction from Your Photos
attractivenesstest.comยท1dยท
Discuss: Hacker News
๐ŸŒCultural Computing
Did you solve it? The simple T-puzzle that fools everyone (at first!)
theguardian.comยท1h
๐Ÿ–‹Typography
Weasel words and co.: Guide to recognising AI-generated texts on Wikipedia
heise.deยท56m
๐Ÿ“œBinary Philology
Preserving the digital legacy of company archives: Last stop, Newhaven.
dpconline.orgยท9h
๐Ÿ’พData Preservation
Albania has appointed AI bot Diella as minister of public procurement
madcornishprojectionist.co.ukยท7h
๐Ÿ‘๏ธOCR Verification
Show HN: Semlib โ€“ Semantic Data Processing
github.comยท3hยท
Discuss: Hacker News
๐ŸŒณIncremental Parsing
How to Remove Invisible Characters From AI Text (Free Tool)
hackernoon.comยท1d
โœ๏ธOCR Correction
Ancient Scripts, Modern AI: Bridging the Divide with Morphology-Aware Tokenization by Arvind Sundararajan
dev.toยท1dยท
Discuss: DEV
๐Ÿ“Concrete Syntax
How to self-host a web font from Google Fonts
blog.velocifyer.comยท2hยท
Discuss: Hacker News
๐Ÿ”คFont Archaeology
Unsupervised Learning: Clustering
dev.toยท21hยท
Discuss: DEV
๐Ÿ“šDocument Clustering